This page is part of the FHIR Specification (v4.0.1: R4 - Mixed Normative and STU) in it's permanent home (it will always be available at this URL). The current version which supercedes this version is 5.0.0. For a full list of available versions, see the Directory of published versions . Page versions: R4 R3 R2

4.3.15.211 HL7 Version 2 Table 0211

Vocabulary

Work Group

Maturity Level: N/A

External

Use Context: Any

This is a table defined as part of HL7 v2 .

Summary

Code System URL: http://terminology.hl7.org/CodeSystem/v2-0211

Value Set URL: http://terminology.hl7.org/ValueSet/v2-0211

Version: 2.9

Name: v2.0211

Title: v2 Alternate Character Sets

Definition:

FHIR Value set/code system definition for HL7 v2 table 0211 ( Alternate Character Sets)

OID: ??

CodeSystem Resource XML / JSON

ValueSet Resource XML / JSON

Note: V2 code systems may or may not be case sensitive. V2 Code systems will have the CodeSystem.caseSensitive correctly populated in a future version of this specification.

This value set is not currently used

Properties

Code

URL

Description

Type

status

http://hl7.org/fhir/concept-properties#status

A property that indicates the status of the concept. One of active, experimental, deprecated, retired

code

deprecationDate

http://hl7.org/fhir/concept-properties#deprecationDate

The date at which a concept was deprecated. Concepts that are deprecated but not inactive can still be used, but their use is discouraged

dateTime

This code system http://terminology.hl7.org/CodeSystem/v2-0211 defines the following codes:

Code	Display	Definition	Deprecated	Comments	Version
8859/1	en: The printable characters from the ISO 8859/1 Character set nl: De leesbare tekens uit de ISO 8859/1 tekenset				added v2.3
8859/15	en: The printable characters from the ISO 8859/15 (Latin-15) nl: De leesbare tekens uit de ISO 8859/15 tekenset (Latin-15)				added v2.6
8859/2	en: The printable characters from the ISO 8859/2 Character set nl: De leesbare tekens uit de ISO 8859/2 tekenset				added v2.3
8859/3	en: The printable characters from the ISO 8859/3 Character set nl: De leesbare tekens uit de ISO 8859/3 tekenset				added v2.3
8859/4	en: The printable characters from the ISO 8859/4 Character set nl: De leesbare tekens uit de ISO 8859/4 tekenset				added v2.3
8859/5	en: The printable characters from the ISO 8859/5 Character set nl: De leesbare tekens uit de ISO 8859/5 tekenset				added v2.3
8859/6	en: The printable characters from the ISO 8859/6 Character set nl: De leesbare tekens uit de ISO 8859/6 tekenset				added v2.3
8859/7	en: The printable characters from the ISO 8859/7 Character set nl: De leesbare tekens uit de ISO 8859/7 tekenset				added v2.3
8859/8	en: The printable characters from the ISO 8859/8 Character set nl: De leesbare tekens uit de ISO 8859/8 tekenset				added v2.3
8859/9	en: The printable characters from the ISO 8859/9 Character set nl: De leesbare tekens uit de ISO 8859/9 tekenset				added v2.3
ASCII	en: The printable 7-bit ASCII character set. nl: De leesbare 7-bit ASCII tekenset.			en: (This is the default if this field is omitted) nl: Dit is de standaardwaarde als het veld wordt leeggelaten.	added v2.3
BIG-5	en: Code for Taiwanese Character Set (BIG-5) nl: Code voor Taiwanese tekenset (BIG-5)			en: Does not need an escape sequence. BIG-5 does not need an escape sequence. ASCII is a 7 bit character set, which means that the top bit of the byte is “0”. The parser knows that when the top bit of the byte is “0”, the character set is ASCII. When it is “1”, the following bytes should be handled as 2 bytes (or more). No escape technique is needed. However, since some servers do not correctly interpret when they receive a top bit “1”, it is advised, in internet RFC, to not use these kind of non-safe non-escape extension. nl: Heeft geen escape-sequentie nodig. BIG-5 heeft geen escape-sequentie nodig. ASCII is een 7-bit tekenset wat betekent dat het top bit van de byte "0" is. De parser weet dat wanneer de top bit van de byte "0" is, de tekenset ASCII is. Als het "1" is, moeten de volgende bytes in paren van 2 bytes (of meer) worden verwerkt. Er is geen escape techniek vereist. Echter, omdat sommige servers niet goed verwerken bij een top bit van "1", wordt geadviseerd in de internet RFC, om dit type onveilige extensie, zonder escaping, niet te gebruiken.	added v2.5
CNS 11643-1992	en: Code for Taiwanese Character Set (CNS 11643-1992) nl: Code voor Taiwanese tekenset (CNS 11643-1992)			en: Does not need an escape sequence. nl: Heeft geen escape-sequentie nodig.	added v2.5
GB 18030-2000	en: Code for Chinese Character Set (GB 18030-2000) nl: Code voor Chinese tekenset (GB 18030-2000)			en: Does not need an escape sequence. nl: Heeft geen escape-sequentie nodig.	added v2.5
ISO IR14	en: Code for Information Exchange (one byte)(JIS X 0201-1976). nl: Code voor informatieuitwisseling (één byte)(JIS X 0201-1976).			en: Note that the code contains a space, i.e., "ISO IR14". nl: Merk op dat de code een spatie bevat, dus "ISO IR14".	added v2.3.1
ISO IR159	en: Code of the supplementary Japanese Graphic Character set for information interchange (JIS X 0212-1990). nl: Code voor het aanvullende Japanse ideogramschrift voor informatieuitwisseling (JIS X 0212-1990).			en: Note that the code contains a space, i.e., "ISO IR159". nl: Merk op dat de code een spatie bevat, dus "ISO IR159".	added v2.3.1
ISO IR6	en: ASCII graphic character set consisting of 94 characters. nl: ASCII ideogramtekenset bestaande uit 94 tekens.			http://www.itscj.ipsj.or.jp/ISO-IR/006.pdf	added v2.7
ISO IR87	en: Code for the Japanese Graphic Character set for information interchange (JIS X 0208-1990), nl: Code voor de Japanse ideogramschrift voor informatieuitwisseling (JIS X 0208-1990).			en: Note that the code contains a space, i.e., “ISO IR87”. The JIS X 0208 needs an escape sequence. In Japan, the escape technique is ISO 2022. From basic ASCII, escape sequence “escape” $ B (in HEX, 1B 24 42) lets the parser know that following bytes should be handled 2-byte wise. Back to ASCII is 1B 28 42. nl: Merk op dat de code een spatie bevat, dus "ISO IR87". De JIS X 0208 heeft een escape-sequentie nodig. In Japan is de escape-techniek ISO 2020. Vanuit basiaal ASCII laat escape-sequentie "escape" $ B (in HEX, 1B 24 42) de parser weten dat de volgende bytes in paren van 2 bytes moeten worden verwerkt. Terug in ASCII is dat dan 1B 28 42.	added v2.3.1
JAS2020	en: A subset of ISO2020 used for most Kanjii transmissions nl: Een subset van ISO 2020 meestal gebruikt voor Kanjii verzendingen		Deprecated		added v2.3, removed after v2.3
JIS X 0202	en: ISO 2022 with escape sequences for Kanjii nl: ISO 2022 met escape-sequenties voor Kanjii		Deprecated		added v2.3, removed after v2.3
KS X 1001	en: Code for Korean Character Set (KS X 1001) nl: Code voor Koreaanse tekenset (KS X 1001)				added v2.5
UNICODE	en: The world wide character standard from ISO/IEC 10646-1-1993 nl: De wereldwijde tekensetstandaard van ISO/IEC 10646-1-1993			en: Deprecated. Retained for backward compatibility only as v 2.5. Replaced by specific Unicode encoding codes. nl: Vervallen. Behouden voor backward compatibility sinds v2.5. Vervangen door specifieke codes voor Unicode tekstcodering.	added v2.3
UNICODE UTF-16	en: UCS Transformation Format, 16-bit form nl: UCS Transformatieformaat, 16-bits vorm		Deprecated	en: UTF-16 is identical to ISO/IEC 10646 UCS-2. Note that the code contains a space before UTF but not before and after the hyphen. nl: UTF-16 is identiek aan ISO/IEC 10646 UCS-2. Merk op dat de code een spatie bevat voor UTF maar niet voor en na het koppelteken.	added v2.5, removed after v2.7.1
UNICODE UTF-32	en: UCS Transformation Format, 32-bit form nl: UCS Transformatieformaat, 32-bits vorm		Deprecated	en: UTF-32 is defined by Unicode Technical Report #19, and is an officially recognized encoding as of Unicode Version 3.1. UTF-32 is a proper subset of ISO/IEC 10646 UCS-4. Note that the code contains a space before UTF but not before and after the hyphen. nl: UTF-32 wordt gedefinieerd door Unicode Technical Report #19 en is een officieel erkende codering sinds Unicode versie 3.1. UTF-32 is een geldige subset van ISO/IEC 10646 UCS-4. Merk op dat de code een spatie bevat voor UTF maar niet voor en na het koppelteken.	added v2.5, removed after v2.7.1
UNICODE UTF-8	en: UCS Transformation Format, 8-bit form nl: UCS Transformatieformaat, 8-bits vorm			en: UTF-8 is a variable-length encoding, each code value is represented by 1,2 or 3 bytes, depending on the code value. 7 bit ASCII is a proper subset of UTF-8. Note that the code contains a space before UTF but not before and after the hyphen. Since UTF-8 represents the full UNICODE character set, the following restriction apply to its use: 1. UTF-8 must be the default encoding of the message, UTF-8 cannot be specified as an additional character set in MSH-18 2. There are no other character sets allowed in a message where UTF-8 is the default encoding in the message. In other words, UNICODE UTF-8 can only be specified as a single value in MSH-18 3. A message encoded in UTF-8 must not use a Byte Order Mark (BOM). nl: UTF-8 is een tekstcodering met variabele lengte, waarbij iedere codewaarde wordt gerepresenteerd door 1, 2, of 3 bytes, afhankelijk van de codewaarde. 7-bits ASCII is een geldige subset van UTF-8. Merk op dat de code een spatie bevat voor UTF maar niet voor en na het koppelteken. Aangezien UTF-8 de volledige UNICODE tekenset vertegenwoordigt, zijn de volgende restricties van toepassing op gebruik:<p/>1. UTF-8 moet de standaardcodering zijn van het bericht, UTF-8 kan niet worden gespecificeerd als extra tekenset in MSH-18<p/>2. Er zijn geen andere tekensets toegestaan in een bericht waarin UTF-8 is standaard tekstcodering is.<p/>Met andere woorden, UNICODE UTF-8 kan niet met andere waarden voorkomen in MSH-18<p/>3. Een bericht in UTF-8 mag <b>geen</b> Byte Order Mark (BOM) bevatten.	added v2.5

Properties

Code

URL

Description

Type

status

http://hl7.org/fhir/concept-properties#status

A property that indicates the status of the concept. One of active, experimental, deprecated, retired

code

deprecationDate

http://hl7.org/fhir/concept-properties#deprecationDate

The date at which a concept was deprecated. Concepts that are deprecated but not inactive can still be used, but their use is discouraged

dateTime

This code system http://terminology.hl7.org/CodeSystem/v2-0211 defines the following codes:

Code	Display	Definition	Deprecated	Comments	Version
8859/1	The printable characters from the ISO 8859/1 Character set				added v2.3
8859/15	The printable characters from the ISO 8859/15 (Latin-15)				added v2.6
8859/2	The printable characters from the ISO 8859/2 Character set				added v2.3
8859/3	The printable characters from the ISO 8859/3 Character set				added v2.3
8859/4	The printable characters from the ISO 8859/4 Character set				added v2.3
8859/5	The printable characters from the ISO 8859/5 Character set				added v2.3
8859/6	The printable characters from the ISO 8859/6 Character set				added v2.3
8859/7	The printable characters from the ISO 8859/7 Character set				added v2.3
8859/8	The printable characters from the ISO 8859/8 Character set				added v2.3
8859/9	The printable characters from the ISO 8859/9 Character set				added v2.3
ASCII	The printable 7-bit ASCII character set.			(This is the default if this field is omitted)	added v2.3
BIG-5	Code for Taiwanese Character Set (BIG-5)			Does not need an escape sequence. BIG-5 does not need an escape sequence. ASCII is a 7 bit character set, which means that the top bit of the byte is “0”. The parser knows that when the top bit of the byte is “0”, the character set is ASCII. When it is “1”, the following bytes should be handled as 2 bytes (or more). No escape technique is needed. However, since some servers do not correctly interpret when they receive a top bit “1”, it is advised, in internet RFC, to not use these kind of non-safe non-escape extension.	added v2.5
CNS 11643-1992	Code for Taiwanese Character Set (CNS 11643-1992)			Does not need an escape sequence.	added v2.5
GB 18030-2000	Code for Chinese Character Set (GB 18030-2000)			Does not need an escape sequence.	added v2.5
ISO IR14	Code for Information Exchange (one byte)(JIS X 0201-1976).			Note that the code contains a space, i.e., "ISO IR14".	added v2.3.1
ISO IR159	Code of the supplementary Japanese Graphic Character set for information interchange (JIS X 0212-1990).			Note that the code contains a space, i.e., "ISO IR159".	added v2.3.1
ISO IR6	ASCII graphic character set consisting of 94 characters.			http://www.itscj.ipsj.or.jp/ISO-IR/006.pdf	added v2.7
ISO IR87	Code for the Japanese Graphic Character set for information interchange (JIS X 0208-1990),			Note that the code contains a space, i.e., “ISO IR87”. The JIS X 0208 needs an escape sequence. In Japan, the escape technique is ISO 2022. From basic ASCII, escape sequence “escape” $ B (in HEX, 1B 24 42) lets the parser know that following bytes should be handled 2-byte wise. Back to ASCII is 1B 28 42.	added v2.3.1
JAS2020	A subset of ISO2020 used for most Kanjii transmissions		Deprecated		added v2.3, removed after v2.3
JIS X 0202	ISO 2022 with escape sequences for Kanjii		Deprecated		added v2.3, removed after v2.3
KS X 1001	Code for Korean Character Set (KS X 1001)				added v2.5
UNICODE	The world wide character standard from ISO/IEC 10646-1-1993			Deprecated. Retained for backward compatibility only as v 2.5. Replaced by specific Unicode encoding codes.	added v2.3
UNICODE UTF-16	UCS Transformation Format, 16-bit form		Deprecated	UTF-16 is identical to ISO/IEC 10646 UCS-2. Note that the code contains a space before UTF but not before and after the hyphen.	added v2.5, removed after v2.7.1
UNICODE UTF-32	UCS Transformation Format, 32-bit form		Deprecated	UTF-32 is defined by Unicode Technical Report #19, and is an officially recognized encoding as of Unicode Version 3.1. UTF-32 is a proper subset of ISO/IEC 10646 UCS-4. Note that the code contains a space before UTF but not before and after the hyphen.	added v2.5, removed after v2.7.1
UNICODE UTF-8	UCS Transformation Format, 8-bit form			UTF-8 is a variable-length encoding, each code value is represented by 1,2 or 3 bytes, depending on the code value. 7 bit ASCII is a proper subset of UTF-8. Note that the code contains a space before UTF but not before and after the hyphen. Since UTF-8 represents the full UNICODE character set, the following restriction apply to its use: 1. UTF-8 must be the default encoding of the message, UTF-8 cannot be specified as an additional character set in MSH-18 2. There are no other character sets allowed in a message where UTF-8 is the default encoding in the message. In other words, UNICODE UTF-8 can only be specified as a single value in MSH-18 3. A message encoded in UTF-8 must not use a Byte Order Mark (BOM).	added v2.5

Eigenschappen

Code

URL

Omschrijving

Type

status

http://hl7.org/fhir/concept-properties#status

A property that indicates the status of the concept. One of active, experimental, deprecated, retired

code

deprecationDate

http://hl7.org/fhir/concept-properties#deprecationDate

The date at which a concept was deprecated. Concepts that are deprecated but not inactive can still be used, but their use is discouraged

dateTime

Dit code systeem http://terminology.hl7.org/CodeSystem/v2-0211 definieert de volgende codes:

Code	Display	Definition	Verouderd	Commentaar	Versie
8859/1	De leesbare tekens uit de ISO 8859/1 tekenset				added v2.3
8859/15	De leesbare tekens uit de ISO 8859/15 tekenset (Latin-15)				added v2.6
8859/2	De leesbare tekens uit de ISO 8859/2 tekenset				added v2.3
8859/3	De leesbare tekens uit de ISO 8859/3 tekenset				added v2.3
8859/4	De leesbare tekens uit de ISO 8859/4 tekenset				added v2.3
8859/5	De leesbare tekens uit de ISO 8859/5 tekenset				added v2.3
8859/6	De leesbare tekens uit de ISO 8859/6 tekenset				added v2.3
8859/7	De leesbare tekens uit de ISO 8859/7 tekenset				added v2.3
8859/8	De leesbare tekens uit de ISO 8859/8 tekenset				added v2.3
8859/9	De leesbare tekens uit de ISO 8859/9 tekenset				added v2.3
ASCII	De leesbare 7-bit ASCII tekenset.			Dit is de standaardwaarde als het veld wordt leeggelaten.	added v2.3
BIG-5	Code voor Taiwanese tekenset (BIG-5)			Heeft geen escape-sequentie nodig. BIG-5 heeft geen escape-sequentie nodig. ASCII is een 7-bit tekenset wat betekent dat het top bit van de byte "0" is. De parser weet dat wanneer de top bit van de byte "0" is, de tekenset ASCII is. Als het "1" is, moeten de volgende bytes in paren van 2 bytes (of meer) worden verwerkt. Er is geen escape techniek vereist. Echter, omdat sommige servers niet goed verwerken bij een top bit van "1", wordt geadviseerd in de internet RFC, om dit type onveilige extensie, zonder escaping, niet te gebruiken.	added v2.5
CNS 11643-1992	Code voor Taiwanese tekenset (CNS 11643-1992)			Heeft geen escape-sequentie nodig.	added v2.5
GB 18030-2000	Code voor Chinese tekenset (GB 18030-2000)			Heeft geen escape-sequentie nodig.	added v2.5
ISO IR14	Code voor informatieuitwisseling (één byte)(JIS X 0201-1976).			Merk op dat de code een spatie bevat, dus "ISO IR14".	added v2.3.1
ISO IR159	Code voor het aanvullende Japanse ideogramschrift voor informatieuitwisseling (JIS X 0212-1990).			Merk op dat de code een spatie bevat, dus "ISO IR159".	added v2.3.1
ISO IR6	ASCII ideogramtekenset bestaande uit 94 tekens.			http://www.itscj.ipsj.or.jp/ISO-IR/006.pdf	added v2.7
ISO IR87	Code voor de Japanse ideogramschrift voor informatieuitwisseling (JIS X 0208-1990).			Merk op dat de code een spatie bevat, dus "ISO IR87". De JIS X 0208 heeft een escape-sequentie nodig. In Japan is de escape-techniek ISO 2020. Vanuit basiaal ASCII laat escape-sequentie "escape" $ B (in HEX, 1B 24 42) de parser weten dat de volgende bytes in paren van 2 bytes moeten worden verwerkt. Terug in ASCII is dat dan 1B 28 42.	added v2.3.1
JAS2020	Een subset van ISO 2020 meestal gebruikt voor Kanjii verzendingen		Verouderd		added v2.3, removed after v2.3
JIS X 0202	ISO 2022 met escape-sequenties voor Kanjii		Verouderd		added v2.3, removed after v2.3
KS X 1001	Code voor Koreaanse tekenset (KS X 1001)				added v2.5
UNICODE	De wereldwijde tekensetstandaard van ISO/IEC 10646-1-1993			Vervallen. Behouden voor backward compatibility sinds v2.5. Vervangen door specifieke codes voor Unicode tekstcodering.	added v2.3
UNICODE UTF-16	UCS Transformatieformaat, 16-bits vorm		Verouderd	UTF-16 is identiek aan ISO/IEC 10646 UCS-2. Merk op dat de code een spatie bevat voor UTF maar niet voor en na het koppelteken.	added v2.5, removed after v2.7.1
UNICODE UTF-32	UCS Transformatieformaat, 32-bits vorm		Verouderd	UTF-32 wordt gedefinieerd door Unicode Technical Report #19 en is een officieel erkende codering sinds Unicode versie 3.1. UTF-32 is een geldige subset van ISO/IEC 10646 UCS-4. Merk op dat de code een spatie bevat voor UTF maar niet voor en na het koppelteken.	added v2.5, removed after v2.7.1
UNICODE UTF-8	UCS Transformatieformaat, 8-bits vorm			UTF-8 is een tekstcodering met variabele lengte, waarbij iedere codewaarde wordt gerepresenteerd door 1, 2, of 3 bytes, afhankelijk van de codewaarde. 7-bits ASCII is een geldige subset van UTF-8. Merk op dat de code een spatie bevat voor UTF maar niet voor en na het koppelteken. Aangezien UTF-8 de volledige UNICODE tekenset vertegenwoordigt, zijn de volgende restricties van toepassing op gebruik:<p/>1. UTF-8 moet de standaardcodering zijn van het bericht, UTF-8 kan niet worden gespecificeerd als extra tekenset in MSH-18<p/>2. Er zijn geen andere tekensets toegestaan in een bericht waarin UTF-8 is standaard tekstcodering is.<p/>Met andere woorden, UNICODE UTF-8 kan niet met andere waarden voorkomen in MSH-18<p/>3. Een bericht in UTF-8 mag <b>geen</b> Byte Order Mark (BOM) bevatten.	added v2.5

A current expansion is provided for this table because it is not simply 'all codes in the table', or because some codes have been deprecated

4.3.15.211.2 Expansion

This expansion generated 01 Nov 2019

This value set contains 21 concepts

Expansion based on http://terminology.hl7.org/CodeSystem/v2-0211 version 2.9

All codes from system http://terminology.hl7.org/CodeSystem/v2-0211

Code	Display	Definition
8859/1	The printable characters from the ISO 8859/1 Character set
8859/15	The printable characters from the ISO 8859/15 (Latin-15)
8859/2	The printable characters from the ISO 8859/2 Character set
8859/3	The printable characters from the ISO 8859/3 Character set
8859/4	The printable characters from the ISO 8859/4 Character set
8859/5	The printable characters from the ISO 8859/5 Character set
8859/6	The printable characters from the ISO 8859/6 Character set
8859/7	The printable characters from the ISO 8859/7 Character set
8859/8	The printable characters from the ISO 8859/8 Character set
8859/9	The printable characters from the ISO 8859/9 Character set
ASCII	The printable 7-bit ASCII character set.
BIG-5	Code for Taiwanese Character Set (BIG-5)
CNS 11643-1992	Code for Taiwanese Character Set (CNS 11643-1992)
GB 18030-2000	Code for Chinese Character Set (GB 18030-2000)
ISO IR14	Code for Information Exchange (one byte)(JIS X 0201-1976).
ISO IR159	Code of the supplementary Japanese Graphic Character set for information interchange (JIS X 0212-1990).
ISO IR6	ASCII graphic character set consisting of 94 characters.
ISO IR87	Code for the Japanese Graphic Character set for information interchange (JIS X 0208-1990),
KS X 1001	Code for Korean Character Set (KS X 1001)
UNICODE	The world wide character standard from ISO/IEC 10646-1-1993
UNICODE UTF-8	UCS Transformation Format, 8-bit form

Additional Language Displays

Code	Nederlands (Dutch, nl)
8859/1	De leesbare tekens uit de ISO 8859/1 tekenset
8859/15	De leesbare tekens uit de ISO 8859/15 tekenset (Latin-15)
8859/2	De leesbare tekens uit de ISO 8859/2 tekenset
8859/3	De leesbare tekens uit de ISO 8859/3 tekenset
8859/4	De leesbare tekens uit de ISO 8859/4 tekenset
8859/5	De leesbare tekens uit de ISO 8859/5 tekenset
8859/6	De leesbare tekens uit de ISO 8859/6 tekenset
8859/7	De leesbare tekens uit de ISO 8859/7 tekenset
8859/8	De leesbare tekens uit de ISO 8859/8 tekenset
8859/9	De leesbare tekens uit de ISO 8859/9 tekenset
ASCII	De leesbare 7-bit ASCII tekenset.
BIG-5	Code voor Taiwanese tekenset (BIG-5)
CNS 11643-1992	Code voor Taiwanese tekenset (CNS 11643-1992)
GB 18030-2000	Code voor Chinese tekenset (GB 18030-2000)
ISO IR14	Code voor informatieuitwisseling (één byte)(JIS X 0201-1976).
ISO IR159	Code voor het aanvullende Japanse ideogramschrift voor informatieuitwisseling (JIS X 0212-1990).
ISO IR6	ASCII ideogramtekenset bestaande uit 94 tekens.
ISO IR87	Code voor de Japanse ideogramschrift voor informatieuitwisseling (JIS X 0208-1990).
KS X 1001	Code voor Koreaanse tekenset (KS X 1001)
UNICODE	De wereldwijde tekensetstandaard van ISO/IEC 10646-1-1993
UNICODE UTF-8	UCS Transformatieformaat, 8-bits vorm